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Abstract. A hereditary property of graphs is a collection of graphs which is closed 
under taking induced subgraphs. The speed of V is the function n i— ► \V n \, where V n 
denotes the graphs of order n in ?. It was shown by Alekseev, and by Bollobas and 
Thomason, that if V is a hereditary property of graphs then 

Vp\ = 2 ( 1 - 1 /r+°(l))« 2 /2 j 

where r = r(V) 6 N is the so-called 'colouring number' of V . However, their results tell 
us very little about the structure of a typical graph GeP. 

In this paper we describe the structure of almost every graph in a hereditary property 
of graphs, V . As a consequence, we derive essentially optimal bounds on the speed of 
V, improving the Alekseev-Bollobas-Thomason Theorem, and also generalizing results 
of Balogh, Bollobas and Simonovits. 



1. Introduction 

In this paper we shall describe the structure of almost every graph in an arbitrary 
hereditary property of graphs, V. As a corollary, we shall obtain bounds on the speed of 
V which improve those proved by Alekseev [I] and Bollobas and Thomason [T6] [T7] , and 
generalize a theorem of Balogh, Bollobas and Simonovits [3 [8] on monotone properties 
of graphs. We begin with some definitions. 

A collection of labelled graphs, V, is called a hereditary property if it is closed under 
re-labelling vertices, and under taking induced subgraphs. It is called monotone if it is 
moreover closed under taking arbitrary subgraphs. Note that a hereditary property may 
be characterized by a (possibly infinite) collection of forbidden induced subgraphs. 

Given a property of graphs, V, let V n = {G G V : |V(G)| = n} denote the graphs 
in V with vertex set [n]. The speed of V, introduced in 1976 by Erdos, Kleitman and 



Rothschild |21j, is the function 

n i ^ \V n \. 

The speed is a natural measure of the 'size' of a property. 
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The possible structures and speeds of a hereditary or monotone property of graphs 
have been extensively studied, originally in the special case where a single subgraph 
is forbidden, and more recently in general. For example, Erdos, Kleitman and Roth- 
schild [21] and Kolaitis, Promel and Rothschild [28] studied i^-free graphs, Erdos, Frankl 
and Rodl [20] studied monotone properties when a single graph is forbidden, and Promel 
and Steger [331 El] obtained (amongst other things) very precise results on the structure 
of almost all (induced-)C4-free and Cs-free graphs. They also were the first to define the 
following parameter of a property of graphs, known as the 'colouring number' of V, which 
will be important in what follows. 

First, for each reN and each vector v G {0, l} r , define a collection 7i(r,v) of graphs 
as follows. Let G G Tl(r, v) if V{G) may be partitioned into r sets Ai, . . . , A r such that 
G[Aj] is the empty graph if Vj = 0, and is the complete graph if Vj = 1. 

Definition. The colouring number Xci'P) °f a property of graphs, V, is defined to be 
XciV) := max [r G N : TC(r,v) C V for some v G {0, l} r }. 

The following result, proved by Alekseev [TJ and Bollobas and Thomason [TBI fTT] . 
generalizes the Erdos- Frankl- Rodl Theorem to a general hereditary property of graphs. 

The Alekseev-Bollobas-Thomason Theorem. Let V be a hereditary property of graphs, 
and suppose Xc(P) = r ■ Then 

Vp\ _ 2 ( 1 - 1 A+°(l))™ 2 /2_ 

The Alekseev-Bollobas-Thomason Theorem shows that the set of possible values for 
the 'entropy' of a hereditary property of graphs, 

ent(P) := lim -L log 2 (|P n |) 

n^oo (J 

is not continuous, but in fact undergoes a series of discrete 'jumps', from 1 — ~ to 1 — ^-j-j-, 
where r G N. However, the proofs of Alekseev and of Bollobas and Thomason tell us very 
little about the structure of a typical graph G G "P. Their theorem also gives rather weak 
bounds on the rate of convergence of the entropy as n — ► oo. 

For monotone properties of graphs, these problems were addressed by Balogh, Bollobas 
and Simonovits [7J [H [9] in a series of papers in which they proved very precise structural 
results, and obtaining correspondingly precise bounds on the rate of convergence of ent('P). 

The following theorem was the main result of [7\. Let ex(n, M) denote the usual 
extremal number of a collection of graphs Ai. 

The Balogh-Bollobas-Simonovits Theorem. LetV be a monotone property of graphs, 
with colouring number Xci'P) — r - Then there exists a family of graphs A4 (containing a 
bipartite graph), and a constant c = c(V), such that the following holds. 

For almost all graphs G G V , there exists a partition (A, Si, ... , S r ) ofV{G), such that 

(a) \A\ ^ c{V), 

{b) G[Sj] is M.-free for every j G [r], 
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and moreover 

For even more precise results see [S] and [H] . Balogh, Bollobas and Simonovits also had 
the following (unpublished) conjecture regarding the speed of hereditary graph properties. 
Let V l (n,Ai) denote the collection of induced- .M-free graphs on [n]. 

Conjecture 1 (Balogh, Bollobas, Simonovits). LetV be a hereditary property of graphs, 
and suppose that Xc{V) = r. Then there exists a family of graphs M. (with Xc(-M) = 1), 
and a constant c = c(V), such that 

2( 1 -l/r)(Z) 2 \-P i (n/r,M)\ ^ Ipl <C 2 {1 ~ 1/r) (i) ra H^W^)|+cn_ 

We remark that, by the results of Promel and Steger [331 Ell, Conjecture [T] holds for 
the properties V = {G : C 4 ^ G} and V = {G : C 5 ^ G}. (Here, and throughout, we 
write H ^ G to mean that H is an induced subgraph of G.) To be precise, they proved 
that the vertex set of almost all CVfree graphs can be partitioned into an independent set 
and a complete graph, and that for almost every C^-free graph G, either G or G has the 
following structure: the vertex set may be partitioned into two classes, V(G) = A UB, so 
that A induces a clique, and B induces a vertex disjoint union of cliques. 

Finally, we note that even more precise structural results have been obtained for hered- 
itary properties of graphs with much lower speeds, by Balogh, Bollobas and Weinre- 
ich [TUl [TTJ [T2] , and for hereditary properties of other combinatorial structures, see for 
example [21 IH El El E31 ED]- Note in particular [2], where Sauer's Lemma (which will be 
a crucial tool in our proof) is used. There has also been some important recent progress 
on hereditary properties of hypergraphs, by Dotson and Nagle [18] and (independently) 
Ishigami [22J, who (building on work of Nagle and Rodl [31] and Kohayakawa, Nagle and 
Rodl [27]) proved a version of the Alekseev-Bollobas-Thomason Theorem for fc-uniform 
hypergraphs, and by Person and Schacht [32] , who showed that almost every Fano-plane- 
free 3-uniform hypergraph is bipartite. 

2. Main Results 

In this section we state our main results. We begin with a definition. 

Definition. For each k € N, the universal graph U(k) is the bipartite graph with parts 
A = [2] k and B = [k], and edge set 

E{U(k)) = {ab : a G A,b e B and be a}. 

A graph G is said to be U(k)-free if there do not exist disjoint subsets A,Bc V(G) such 
that G[A, B] = U{k). {G[A,B\ denotes the bipartite graph induced by the pair (A, B).) 

The following theorem is the main result of this paper. 

Theorem 1. Let V be a hereditary property of graphs, with colouring number Xc{V) = r. 
Then there exist constants k = k(V) G N and e = e(V) > such that the following holds. 
For almost all graphs G G V , there exists a partition (A, Si, ... , S r ) ofV{G), such that 
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(a) \A\ ^ n 1 - 6 , 

(b) G[Sj] is U(k) -free for every j G [r]. 
Moreover 

2 (l-l/r)n 2 /2 ^ |p I ^ 2 ( 1 - 1 ^)™ 2 /2 + n 2 - £ 

for every sufficiently large n£l. 

We shall in fact prove, not just that this structural description holds for almost all 
graphs G G V, but that the number of graphs in V n for which it fails is at most 

2- n2 ~ E \V n \ 

if n is sufficiently large. The final part of Theorem [T] is an immediate consequence of 
conditions (a) and (6), and the following theorem. 

Theorem 2. For each k G N there exists e = e(k) > such that the following holds. 
There are at most 2 n E distinct U(k)-free graphs on [n]. 

The structure of the remainder of the paper is as follows. First, in Section [3], we shall 
state the main tools we shall use in the paper: these include the Szemeredi Regularity 
Lemma, the Erdos-Simonovits Stability Theorem, and Sauer's Lemma. In Section [4] we 
give a sketch of the proof of our main result in the case Xc(P) = 2, and in Section [5] 
we prove Theorem [21 and deduce our bounds on the speed of V. In Section [6] we prove 
various lemmas on U(k)-free graphs, and in Section [7] we prove Theorem [TJ In Section [8] 
we shall show how to prove even sharper results in the bipartite case, and in Section [91 
we finish by stating a couple of questions and open problems. 

3. TOOLS 

In this section we shall recall some of the important tools we shall use in order to prove 
Theorems [T] and [2J In particular, we shall recall the Szemeredi Regularity Lemma |38| . 
the Erdos-Simonovits Stability Theorem [ISIEZ], and Sauer's Lemma [SB] . 

Let G be a graph, let A,B C V(G) with A n B = 0, and let e > 0. We write d(A, B) 
for the density of the bipartite graph G[A, B}. We say that the pair (A, B) is e-regular if 

\d(A,B) -d(X,Y)\ < e 

for every X C A and Y C B with \X\ ^ e\A\ and \Y\ ^ e\B\. 

Definition. A partition A% U . . . U A^ of V(G) is said to be a Szemeredi partition of G 
for £ if | Ai| ^ ... ^ \Af-\ ^ | Ai| + 1, and all but ek 2 of the pairs (Aj, Aj) are e-regular. 

Szemeredi's Regularity Lemma (Szemeredi, 1976). Let e > and m G N. There 
exists an M = M(m, e) G N such that, given any graph G, there exists a Szemeredi 
partition of G for e into k parts, for some m ^ k ^ M. 

We shall also need the following 'refinement' version of Szemeredi's Lemma. Let G be 
a graph and P = (Ui, . . . , U-t) be a partition of V(G). A Szemeredi refinement of the 
partition P for e is a refinement of the partition P which is also a Szemeredi partition of 
G for e. 
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Szemeredi's Lemma: refinement version. Let e > and m G N. There exists an 
M = M(m, e) G N such that, given any graph G, and any partition P of V(G) into 
at most m parts, there exists a Szemeredi refinement of P for e into k parts, for some 
m ^k < M. 

Let T r (n) denote the Turan graph, and t r (n) = e(T r (n)) the Turan number, as usual. 

Erdos-Simonovits Stability Theorem (Erdos, Simonovits, 1968). For eachr G N and 
e > 0, there exists a 5 > such that the following holds. Let G be a graph, and suppose 
that K r+ i <£_ G, but 

e(G) ^ t r (n) - 5n 2 . 
Then we can change G into T r (n) by switching at most en 2 edges. 

We say a set X C [n] is shattered by a family of subsets A C V(n) if for every set 
B C X, there exists aniei such that B n X = A. 

Sauer's Lemma (Sauer, 1972). Let A C V(n), and suppose that 

k-l , V 

w>E(3- 

T/ien t/iere exists a k-set X C [n] which is shattered by A. 

Given A, B C V(G), we shall also write A — > 5 to mean A 'shatters' B, i.e., that 
= U(k) for some A' C A, where fc = \B\. Note that A ->• B if and only if the 
family of sets A = {T(v) H B : v <E A} shatters B. 

Given e,5 > 0, a pair (A, B) of subsets A,B C V(G) is said to be (e, 5) -grey if the 
graph G[A, B] is e- regular and has density 5 ^ d(A, B) ^ 1 — 8. 

The following Embedding Lemma is well-known (see [29J, Theorem 2.1). 

Embedding Lemma. Let 5 > 0, and let H be a graph on r vertices. There exist e > 
and no = n (e, 5,r) G N such that the following holds. 

Let G be a graph on vertex set A\ U . . . U A r , where \Aj\ ^ n for each j G [r]. Then 
there exist distinct vertices G A k for each k G [r] such that, whenever the pair (A iy Aj) 
is (e, 5) -grey, we have 

ViVj G E(G) <=> ij G E(H). 

We shall also use the following simple result, known as the Slicing Lemma (see [29j, 
Fact 1.5). 

The Slicing Lemma. // (A, B) is (e,5)-grey and X C A, Y C B with \X\ ^ a\A\ and 
\Y\ ^ ol\B\, then the pair (X, Y) is (e', 6') -grey, where e' = 2e/a and 5' = 5/a — e. 

Finally, we make a trivial observation. 

Observation 3. Let V be a hereditary property of graphs, and suppose Xci'P) — T ■ Then 

\V n \ ^ 2 (1_1/r+o(1))n2/2 . 

Proof. By definition: there are this many graphs in TC(r,v). □ 
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As noted in the introduction, Alekseev [I] and Bollobas and Thomason [TBI [TT] inde- 
pendently proved the corresponding upper bound. 

4. Sketch of the proof 

Before proving Theorem [TJ let us a give a brief (and imprecise) sketch of the proof. For 
simplicity we shall only consider the case Xci^P) = 2. 

Let e, 6, 7, a > be sufficiently small, with e < i5 <C 7 < a, and let G G V n , where n 
is sufficiently large. We shall say that the bipartite subgraph of G induced by a pair of 
sets is grey if it is (e, <5)-grey, i.e., it is e-regular and of density between 6 and 1 — 5. We 
shall also use the following definition of a generalized universal graph: 

Let U(3, k) denote the 3-partite graph on vertex set A U B U C, where 
\ A \ = 2^1+1^ \B\ = 2 k and \C\ = k, such that B -> C and A -> B U C. 

The first step in the proof is to show that, for almost all G G V, there is a partition 
(Si, S2) of V(G) such that, 

(a) Each part is a union of (an almost equal number of) Szemeredi sets. 

(b) Each part contains at most 7m 2 grey pairs (where m is the total number of Sze- 
meredi sets). 

The proof of this (see Lemma [T6|) follows as in [7] , by applying the Szemeredi Regularity 
Lemma and Erdos-Simonovits Stability Theorem. The key observation is that the 'cluster 
graph' contains no grey triangles (see Lemma [9]). We call such a partition (Si, 5 2 ) a BBS- 
partition of G. 

Next we consider a maximal set B C V(G) of vertices such that, for each j = 1,2, and 
each pair b, V G B, 

[(r^n^A^On^)! ^ an. 

We shall sometimes refer to a set with this property as a set of 'bad' vertices. The main 
step in the proof is to show that, for almost every G G V, \B\ is bounded. 

Indeed, we show (see Lemmas dOl dH and [T2l that, for any t G N, if \B\ is suffi- 
ciently large then there exist a set B' C B of size 2 2t , and sets , . . . ,T^ C Si and 
Ti , . . . ,Tf C 5*2, with \T®\ ^ 5n, such that the following holds: 

(a) All vertices of Tj have the same neighbourhood in B' . 

(b) If b u b 2 G 5' with 61 ^ 6 2 , then r(6i) n (J 7^ T(6 2 ) n \J Tf\ (Since there are 2 2t 
vertices in B', this means they shatter any set of representatives of the sets Tj .) 

The proof of Lemma [10] uses the so-called 'sparsening method', together with a re- 
peated application of the 'reverse' Sauer's Lemma; that is, Sauer's Lemma combined 
with Lemma HI the observation that if U — > V, then V — > U' for some (large) U' C U. 

Now, suppose such a set B' exists in G. We show (see Lemma [TT]) that in almost 
every such graph we can find subsets C Tj such that, for each p, q G [t], the pair 
is grey. Hence, by the Embedding Lemma, we can find a copy of U(3, k) in 
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G, for arbitrarily large k (see Lemma [TSj) . But this is a contradiction, since Xci'P) — 2 
(see Lemma Cj). 

We have shown that \B\ is bounded for almost every G G P. Since B is maximal, it 
follows that each vertex v G V(G) is a 'clone' of some vertex b £ B with respect to one 
of the sets Sj, i.e., 

|(r(i,)ns,)A(r(&)n^)| < m . 

Since we expect to have few choices inside the sets Sj, it would be natural to expect that 
v G Sj. Although this is not necessarily true for every vertex v G V(G), it turns out 
that, for almost every graph G G P, we can make it true by 'adjusting' the partition 
(Si, S2) (see Lemmas [201 and I2TI) . We obtain a new partition, (S[, S' 2 ), which is 'close' to 
the original partition (in the sense that \SjAS'j\ ^ an), such that for each j = 1,2, and 
every v G Sj, v is a clone of some b G B with respect to Sj. 

Finally, let Uj C Sj be the vertex set of a maximal collection of disjoint copies of U(k) in 
Sj. (When Xci'P) ^ 3 this step is more complicated, see the algorithm before Lemma 123]) . 
We claim that \Uj\ = O (n 1 ^ 6 ) for almost every G G P; to prove this, we simply count 
(see Lemma 1231) . First, note that there are at most n n choices for the partition of V(G), 
and for the edges incident with B, and at most 

2a\Uj\n+n 2 - £ 

choices for the edges inside Sj. (This follows because each vertex of Uj is a clone of a 
vertex in B with respect to Sj, and by Theorem [2], using the fact that Sj \ Uj is U (fc)-free.) 
We will show further (see Lemma [TBI) that we have at most 2 ( - 1 ^ 2 ~ 2a ^ u ^ n choices for the 
edges between Uj and V(G) \ Sj. Thus the total number of choices, \V n \, satisfies 

lo&Onl) < (l^i- 1^1) (15^1 -|[/ 2 |) + Q-aVl^l + I^Dn + O^ 2 -) 

^ — - a(|C/i| + |[/ 2 |)n + 0(n 2 - £ ), 

which implies that \Ui\ + l^l = O (n 1 ^ 6 ), as required. (We have assumed for simplicity 
that \Uj\ = o(n); the calculation in the other case is essentially the same.) Letting 
A = U\ U U2 U B, we obtain Theorem [U 

5. Proof of Theorem [2] 

In this section we give a short proof of Theorem [2j Our main tool is Sauer's Lemma. 

Proof of Theorem^ Let k G N, and let G be a U(k)-free graph on n vertices. We first 
claim that, given any bipartition (A,B) of the vertex set V(G), there are at most 2 n 
choices for the cross-edges. 

Indeed, let < e < l/(k + 1), and partition B into sets Bi, . . . , B t of size about n £ . By 
Sauer's Lemma, for each j G [t] we have 

\{S C Bj : 3a G A such that T(a) D Bj = S}\ ^ k( j Bj ^ J < n efc , 
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since G[A,Bj] is U(k)-free. Thus the number of choices for G[A, B] is at most 

* /o\ B i\\ i- P 

III ) (n £k ) n ^ 2^^ B ^ n£k {2 eknXogn ) n ^ 2 n2 ~ £logn , 

since 5^ . | JB 3 -| = n and 1 + efc < 2 — e. 

Finally, we may cover E(G) with logn bipartite graphs, and so the number of choices 
for G is at most 2™ 2 £ ( lo s n ) 2 <^ 2™ 2 e for any e' < e, as required. □ 

The reader will have noticed that the value of e obtained above is not best possible; a 
more precise calculation is undertaken in Section [SJ 

6. Some lemmas on universal graphs 

In this section we state some of the lemmas we'll use to prove Theorem HJ We begin 
with a simple but key observation. Recall that we write A — > B to mean that A shatters 
B, i.e., G[A', B] = U(k) for some A' C A, where \B\ = k. 

Lemma 4. Let G be a graph and let i G N. Suppose A,Bc V(G), with A — > B and 
\B\ ^ 2*. Then there exist subsets A' C A and B' C B such that B' — * A', and \A'\ = t. 

Proof. Let A — > B be as described, and assume (taking subsets if necessary) that \B\ =2* 
and G[A, B] = U(2 t ). Label the elements of B with the vertices of the hypercube [2]* 
arbitrarily, and consider the faces of this cube (i.e., the subcubes of dimension t — 1) which 
contain the origin. Denote by Fx, . . . , F t the corresponding subsets of B. 

Now, there is an obvious bijection between vertices of A and subsets of B (a vertex 
is mapped to its neighbourhood), and so each set Fj C B corresponds to a vertex of A. 
Let A' = {(j)(Fx), . . . , 4>{F t )}. 

We claim that for each pair of vertices b, b' G B, we have T(b) C\A'^ r(6') D A'. Indeed, 
if b 7^ b' then their labels must differ in some direction on the cube, so b G Fj, b' Fj, 
say. But then <f>(Fj) G T(b) \ T(b'), as claimed. Thus A' and B' are sets as required by 
the lemma. □ 

We shall also use the following slight extension of Lemma HI 

Lemma 5. Let G be a graph and let r, t G N. Let A\, . . . , A r , B C V(G) be disjoint sets, 
with \B\ ^ 2 rt , and Aj — > B for each j G [r]. Then there exist subsets A'j C Aj and 
B' C B such that B' —> [j j A'j, and \A'j\=t for each j G [r). 

Proof. Assume (by taking a subset if necessary) that \B\ — 2 rt . By Lemma HI there exist 
subsets A* C Aj such that \A*\ = rt and B — >• A* for each j G [r]. Moreover, we can 
choose the sets A* so that the following holds: 

• Let (v^\ . . . jV^) be an arbitrary order for the elements of A*. Then, for each 
i G [rt] and j G [r], there exists a vertex Vj G A* with the same neighbourhood 
m B as v\ . 
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Indeed, to do this we simply use the same [2] ^labelling of B (see the proof of Lemma H] 
above) for each set Aj. 

Now we simply have to choose r disjoint t-subsets A'j C A* for j G [r]. To spell it out, 

let Aj = {vf : (j - l)t + 1 < i < jt} for each j G [r]. It is clear that £ -> [Jj ^j, as 
required. □ 

In SectionS] we used the 3-partite graph U(3, k). We now make the natural generaliza- 
tion to r-partite graphs, which we shall denote U(r, k). 

Definition. For each k, r G N, define the generalized universal graph, U(r,k), to be the 
r-partite graph on vertex set Ax U . . . U A r , where \Ax\ = k and |^4j+i| = 2^i for each 
1 ^ j ^ t — 1, such that 

-> A x U . . . U A,- 

for each 1 ^ j ^ r — 1. 

Moreover, for each v G {0, l} r , define U*(r, k) to be the graph on vertex set AiU. . .UA r 
such that the induced r-partite graph is U(r,k), and G[Aj] is either complete or empty 
for each j G [r], with Cr[Aj] complete if and only if Vj — 1. 

We next apply Lemma H] to prove a Ramsey-type theorem for the graphs U(r, k). 

Lemma 6. For each r,k G N ; there exists K = K(r, k) G N suc/i that the following holds. 
Let G be a graph on vertex set Ax U . . . U A r , and suppose G[Ax, ■ ■ ■ , A r ] = U(r, K). Then 
U*(r, k) ^ G for some v G {0, l} r . 

Proof. The proof is by induction on r. For r = 1 the lemma is just Ramsey's Theorem. 
For r = 2 we first apply Ramsey to Ax, to obtain a subset B\ C Ax such that G[-E>i] is 
either complete or empty. Note that A% — > Bx, so by Lemma HI there exists a set B' 2 C 
such that .Bi —>■ B' 2 . Finally, applying Ramsey to B' 2 , we obtain a subset B 2 C -B2 such 
that G[-B 2 ] is complete or empty, and Bx — > B 2 . 

So let r ^ 3, and assume the result holds for smaller values of r. Let £ = \U*,(r — 1, k)\, 
T = 2* and m = ( R y ), where -R(T) is the Ramsey number. Let if' = mk, and let 

= K(r — 1, iT'). We claim that the lemma holds for K. 

First, by the induction hypothesis, there exists a copy H of U*, (r — 1, K') in G[Ax U . . . U 
Ar-i], for some v' G {0, l}''" 1 . Note that A r -> V A ( J ff), and let = Si U . . . U £ r _i, 

where Bj + x — > Si U . . . U Bj for each 1 ^ j ^ r — 2. 

Since A r — > V(iZ'), the bipartite graph G[A r ,V(i?')] contains every (small) bipartite 
graph as an induced subgraph. We shall define a specific such bipartite graph, F, and 
show that it contains U*(r, k) for some v G {0, l} r . 

Indeed, since K' = mk, H contains m disjoint copies of U*,(r — 1, k). To see this, simply 
partition Bx into m equal-size parts, B± , . . . , B[ m \ and successively choose disjoint sets 
Bf\ S] m) C Bj such that Bf -»■ sj i} U . . . U sj l 2 r Let ffW denote the graph induced 
by Bf U . . . U S^l 

Now, define the bipartite graph F as follows. Let X(T) denote the set of subsets of 
{!,..., R(T)} of size T. 
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(a) V(F) = XUY, with X = [R(T)] and Y = (J Y z , where the sets Y z are disjoint, 

zex(r) 

and \Yz\ — t for each Z G X(T). 
(6) For each subset Wclof size T, we have — > Yw- 
(c) The other edges may be chosen arbitrarily. 

Note that no two vertices of X have the same neighbourhood in Y, since each such pair 
is contained in some T-set W C X, so differs on Y\y. Therefore, since A r — > V(H), it 
follows that there exists a set A' r C A r such that 

G[A' r ,V(H {1) )u ...UV(H {m) )] = F, 

with Y z = V(HM Z ») for each Z G J(T), for some bijection (f> : X(T) — > [m]. 

Finally we apply Ramsey's Theorem to A' r , to obtain a set -B r of size T such that G[-B r ] 
is complete or empty. By the definition of F we have B r — > Yb t , so B r — > V(iJ^) for some 
i G [m]. It follows that the set -B r U Ye r induces a copy of £7*(r, fc) for some v G {0, 
as required. □ 

The following immediate consequence of Lemma M says that if Xci'P) — r , and there is 
an arbitrarily large copy of U(r + 1, k) in G, then G contains a forbidden graph of V. 

Lemma 7. Let r,m G N, and for each v G {0, l} r choose a graph H v G 7i{r,v) with 
\V(H V )\ ^ m. Let k = k(r,m) G N be sufficiently large, and let G be any graph. 
Then either G is U(r, k)-free, or H v ^ G for some v G {0, l} r . 

Proof. Suppose that G contains a copy of U(r, k). Then, by Lemma 0, G also contains an 
induced copy of U*(r,m) for some v G {0, But H v ^ U*(r, k) for every H v G TC(r,v) 
with \H V \ ^ m, and so we are done. □ 

Before continuing, we note that we have already proved Theorem [1] in the case r = 1. 

Corollary 8. Let V be a hereditary property of graphs with Xci'P) — 1- Then there exists 
k = k(V) G N such that G is U{k)-free for every G G V , and hence there exists e > 
such that 

„2-e 



for every sufficiently large n G N. 



Proof. Since Xci'P) = 1, there exists, for each v G {0, l} 2 , a graph H v G 1~t(2, v) such that 
H v G" P. By Lemma it follows that there exists 6 N such that G is {7(fc)-free for 
every G G P. By Theorem (2], it follows that 



\V n \ ^ 2 n2 ~ E 



for every sufficiently large n G N, as required. □ 



The following 'induced' embedding lemma is a simple consequence of the Embedding 
Lemma, the Slicing Lemma and (the proof of) Lemma [71 
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Lemma 9. Given 6 > and m, r E N, there exist e > and n = n Q (e, 5, m,r) E N such 
that the following holds. Let G be a graph on A\ U . . . U A r , where \Aj\ is no /or eac/i 
j G [r] ; and suppose each pair (Ai,Aj) is (e, 5) -grey. 

Then for some v G {0, l} r , H ^ G for every H G TC(r, v) with \H\ ^ m. 

Proof. We claim that G is not C/(r, fc)-free, where k = k(r, m) is the constant in Lemma 
if n is sufficiently large. Indeed, by the Slicing Lemma, if we partition each set Aj into 

t = \U(r,k)\ almost equal sets (Aj ,...,Aj ), then each pair (A^\A^,) (with j ^ f) 
is (e', 5') -grey, where e' = 2e/t and 5' = 5/t — e. Therefore, by the Embedding Lemma, 
there exists a copy of U(r, k) in G, as required. 

But now, by the proof of LemmaO, G also contains an induced copy of U*(r, m) for some 
v G {0, l} r , and so H v ^ U*(r, k) for every H v G H(r,v) with \H V \ ^ m, as required. □ 

We next prove our key lemma whose proof uses the so-called 'sparsening' method. 

Lemma 10. For each a > and t EN, there exist c = c±(a, t) EN and 5 = S\(c, a,t) > 
such that the following holds. Let G be a bipartite graph with parts U and V, satisfying 
\U\ ^c, \V\ = n E N and 

|r(u)Ar(0| ^ an 

for each u,u' E U with u ^ u' . 

Then there exists a subset U' C U, with \U'\ = t, and sets Ti,...,T 2 t C V, with 
\Tj\ ^ Sn for each j E [2*], such that the following holds: 

(a) Ifu,ve Tj then T(u) n U' = T(v) n U' . 

(b) IfW— {w%, . . . , w 2 t}, where Wj E Tj for each j E [2*], then W —>U'. 

Proof. Assume that \U\ = c ^ ( 51ogc ) 2 , and with foresight, let p = °^ c - \y e c l a i m 

a an 
that there exists a subset X C V, with |X| = pn, such that 

v(x)nx ^ r(i/)ni 

for each x,y E U with x ^ y. Indeed, if we choose a random subset X C V of size pn, 
then 



p(r W nx = r W nx) < ("-")/(") * (1 _ a - 

V / \ pn J / \pn J 

for each such pair {x, y}, and so 



pn 



E 



{rr,3/}e [ U \ : r(x) n x = r(y) n X 



( ' C/ ' )(1 ^ c 2 e - pQn < I . 



by our choice of p. 

Thus such a set X must exist, as claimed. Now, since c is sufficiently large so that 



c £ \Xf = (pnf = 



2< 
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then, by Sauer's Lemma, there exist sets U* C U and X* C X, with |X*| ^ 2*, such that 
U* — > X*. Thus, by Lemma HI there exist sets U\ C U* and Xi C X*, with \Ui\ = t, such 
that X 1 ^U 1 . 

Now, let us remove X% from V, and repeat the process, obtaining disjoint sets X 2 , . . . , Xg. 

CtTl 

Since \Xj\ = 2*, we can do this so long as £ ^ ot+T" * s eas ^ ^° see n °thing goes 
wrong in the calculation above when we replace a by a/2.) By the pigeonhole principle, 
there is a set U' C U which occurs (as the set Uj) at least 

\IJ\Y 1 an 

t J 2 t+1 C t 

times. Let J = {j : Xj -> [/'}, and note that \ J\ ^ <Jn. Write X,- = {^(1), . . . ,x j (2 t )}, 
where T(xj(k)) f] U' = T(x f (k)) n C/' for every j, j' G J", and let T fc = {^-(/c) : j G J}, 
for each G [2*]. The sets Ti, . . . ,T 2 t satisfy conditions (a) and (6), as required. □ 

The corresponding result for r-partite graphs follows as an easy corollary. 

Lemma 11. For each a > and r,t G N, there exist c = C2{a,r,t) G N and 5 = 
82(0, a, r,t) > such that the following holds. Let G be a graph onneN vertices, let 
(Si, . . . , S r ) be a partition ofV(G), and let B C V(G) satisfy \B\ ^ c and 

\(T(b) nSj)A(r{V) nSj)\ > an 

for every j G [r], and each b,b' G B with b 7^ V . 

Then there exists a subset B' C B, with \B'\ = t, and sets T± \ . . . , Tit C Si for each 
i G [r], with |T- | ^ 5n ; such that the following holds: 

(a) Ifu,ve Tf then T(u) H B' — T(v) fl B' . 

(b) IfW = {wi, . . . , w 2 t}, where Wj G Tj 1} U . . .urn for each j G [2*], then W -> £' . 

Proof. Let ti > £2 > • • • > t r +i — t be a sequence satisfying i,- ^ ci(a,i, + i) for each 
j G [r] (where ci is the function in Lemma [TU1) . and assume that \B\ = c ^ ti. Applying 
Lemma [TU1 (with t = ti) to the pair (B,Si) = (U,V), we obtain a set Bi G B and 
a collection of sets T^\...,T^ C Si given by that lemma. In particular, we have 
\B\\ ^ t 2 and |T- | ^ 5n, where 5 = <5i(ti,a,t : 



2J 



Similarly, for each g G [2, r] we may apply Lemma [TU] (with c = t q and t = t q+ i) to the 



pair (Bq-i, S q ), to obtain sets 5^ C B q -i and T-j , . . . , T^ C S 3 , with \B q \ = t q+ i and 



|T- | ^ 5n, where 5 = 5i(t q ,a,t q+ i). Let 5 2 (c, ct,t) = mm q {5i(t q ,a,t q+ i)}. 

Finally, for each q G [r], re-number so that the sets T± , . . . , T 2 t shatter £> r C It 

follows that the sets 5' = B T and {T- : i G [r], j G [2*]} are those required by the lemma, 
and so we are done. □ 



We shall in fact use the following immediate corollary of Lemmas and [TTJ 
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Lemma 12. For each a > and r,t G N, there exist c = c^(a,r,t) G N and 5 = 
8s(c,a,r,t) > such that the following holds. Let G be a graph on n G N vertices, let 
(Si, . . . , S r ) be a partition ofV(G), and let B C V(G) satisfy \B\ ^ c and 

\(T(b)nS j )A(Y(b')nS j )\ > an 

for every j G [r], and each b,b' G B with b ^ V . 

Then there exists a subset B' C B, with \B'\ = 2 rt , and sets T± \ . . . , C Si for each 
i G [r], with |T^| ^ 5n, such that the following holds: 

(a) Ifu,ve Tf then T(u) D B' = T(v) D B' . 

(b) IfW = { Wn, . . . , w rt }, where Wij G Tj for each i G [r], j G [t], i/ien -B' — ► W . 

Proof. First we apply Lemma [Til to get sets (for each % G [r] and j G [2' B '']) which 
shatter S'. Applying Lemma [5] to these sets (or, if the reader prefers, to an arbitrarily 
chosen element from each set) gives the required sets. □ 

The following two observations will be useful in Section [71 

Observation 13. Let 5 > be sufficiently small, and let \A\ = \B\ = n. There are at 
most 2 &n2 bipartite graphs on AU B of density at most 5 2 . 

Proof. If G is such a graph then e(G) ^ m = 5 2 n 2 , so the number of choices is at most 




as required, since (l/x 2 ) x — * 1 as x — > 0. □ 

Recall that K T (t) denotes the Turan graph on rt vertices, i.e., the complete r-partite 
graph with t vertices in each part. 

Observation 14. For each r, t G N, there exist e > and hq = no(r, t) G N such that the 
following holds. Let G be an r-partite graph on vertex set AiVJ. . .UA r and, for each j G [r], 
let Bj(l) U . . . U Bj(t) be an equipartition of Aj. Suppose n ^ hq, \Ai\ = . . . = \A r \ = n, 
ande(G)^(l-e)Qn 2 . 

Then there exists a copy H of K r (t) in G with \H D Bj(k)\ ^ 1 for each j, k. 

Proof. Since e may be chosen so that er 3 t 3 1, the result is trivial by the greedy 
algorithm. To spell it out, for each i,j there exists a vertex v G Bj such that T(v) avoids 
at most \By \/rt vertices of each other set B^, \ □ 

Finally, we prove the following easy lemma, which bounds the number of ways in which 
a copy of U(r, k) in G can be attached to the rest of the graph without creating a copy 
oiU(r + l,k). 
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Lemma 15. For each r,k G N, there exists K = K(r, k) G N and S = S(r, k) > such 
that the following holds. Given a vertex set A U B, with \A\ = \U(r, k)\ and \B\ = n, let 

g(r, k,n) := {GL4, B] : 3 &U(r+ 1, jfe)-free graph G on A U B with G[A] = U(r, jfe) J, 

the set of bipartite graphs on AU B which do not create a copy of U(r + 1, k). Then 

\G(r,k,n)\ < 2 |l/(r ' fc)|n - 5 ". 

Proof. Since G is U(r + 1, fc)-free, no set X C B shatters A. This mean that there exists 
a subset Y C A such that r(u) D A ^ Y for every u G -B. We therefore have at most 

(2 |A| - l)™ ^ 2 |l/(r ' fc)|n " <5?1 

choices for the edges of G[A, B], where S = 2~' c ^ r ' fc )', as required. □ 

7. Proof of Theorem [1] 

In this section we shall describe several 'bad' properties of a graph, and prove that 
the number of graphs in a hereditary property of graphs V with one of these properties 
is od'Pnl). We then deduce Theorem [1] by observing that all remaining graphs have the 
required structure. 

We begin with an important definition, motivated by [7]. 

Definition. Let e, 5, 7 > 0, let r G N, let G be a graph, and let V(G) = S\ U . . . U S r be a 
partition of V(G). We say that P = (Si, . . . , S r ) is a BBS-partition of G (for (e, 5, 7)) if 
there exists a Szemeredi partition of G (for e) into m parts (for some \ je < m G N) such 
that: 

(a) Each part 5^ is a union of (an almost equal number of) Szemeredi sets. 

(b) Each part 5^ contains at most 7m 2 pairs which are (e, 5)-grey. 

Note that if (Si, . . . , S r ) is a BBS-partition of G for (e, 5, 7), then it follows that 

i-ejn < |S,| ^ (^ + ^)n 

for each j G [r], by condition (a). 

The following lemma was proved (in the monotone case) by Balogh, Bollobas and 
Simonovits [7j using the Szemeredi Regularity Lemma, the Erdos-Simonovits Stability 
Theorem, and the Embedding Lemma. The proof is essentially the same in our case, but 
for the sake of completeness we shall give a fairly complete sketch. 

Lemma 16 (Balogh, Bollobas and Simonovits [7]). Let r G N, let 7 > ; and let 5 = 

5(7, r) > and e = e(5, 7, r) > be sufficiently small. Let V be a hereditary property of 
graphs with Xc(P) — r ■ 

For almost every graph G G V , there exists a BBS-partition of G for (e, 5, 7) into r 
parts. 
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Proof. First, since XciP) — r , it follows that for each v G {0, l} r+1 there exists a 'for- 
bidden' graph H v G H(r + l,v), such that H v G" V. Choose such a graph for each 
v G {0, l} r+1 , and let t = max{\V(H v )\ : v G {0, l} r+1 }. 

Let G G V n , with neN sufficiently large, and apply the Szemeredi Regularity Lemma 
(for e and m — 1/e) to the graph G. We obtain a collection of k parts, B\, . . . , B^, define 
a graph H on k vertices by letting ij G E(H) if and only if the pair (Bi, Bj) is (e, <5 2 )-grey. 
(This is called the cluster graph of G.) 

Suppose first that K r+1 C H. Then, applying Lemma EH we deduce that H ^ G for 
every H G TC(r + 1, v) with \H\ ^ £, for some v G {0, l} r+1 . Therefore H v G "P, which is 
a contradiction. 

Suppose next that the number of edges in the cluster graph H satisfies 

(i-i -*)(*). 

It is easy to bound the number of graphs G with at most this many edges. Indeed, there 
are at most n n ways of choosing the Szemeredi partition, and, by Observation [13] and our 
choice of m, at most 2^ e+5 ^ n ways of choosing the edges inside the parts, and between 
non-grey pairs. Moreover, there are at most 2( 1 ~ 1 / r ~ 25 ) n I 2 ways of choosing the edges 
between grey pairs. But 

\V n \ > 2 (1 ~ 1/r+o(1))n2/2 , 

by Observation [31 so the number of such graphs Gonri vertices is o(|7-n|). 

Hence we may assume that H is iT r+1 -free, and has at least (1 — 1/r — 25)(„) edges. 
By the Erdos-Simonovits Stability Theorem, it follows that we can change H into the 
Turan graph T r (k) by changing at most jk 2 edges. But this is exactly the definition of a 
BBS-partition, and so we are done. □ 

Next, we need to count those graphs which have large 'irregularities' between pairs 
(Si, Sj) of their BBS-partition. The following definition is designed to allow us to take 
advantage of the 5n-sets given by Lemma O 

Let £, 5, 7, a > 0, let n G N, and let V be a hereditary property of graphs. To simplify 
the notation in what follows, we shall suppress dependence on e, 5 and 7. Define a set 
A(V n , a) C V n as follows: 

A(V n , a) := {G G V n : 3 a BBS-partition (Si, . . . , S r ) of G for (e, 5, 7), and sets X C Sj 
and Y C Sj, for some i 7^ j, with |X|, \Y\ ^ an, such that d(X, Y) ^ (S, 1 — S)j. 
The following lemma says that the collection A(V n , a) is small. 

Lemma 17. Let a > 0, let 2 ^ r G N, and Zei £ > ; 5 > and 7 > be sufficiently 
small. Let V be a hereditary property of graphs with Xci'P) — r > an d let n EN. Then 

\A(V n ,a)\ ^ 2 ( 1 - 1 A0n 2 /2-^/3 = 

Proof. For the first inequality we simply count. We have at most n n choices for the BBS- 
partition Si U . . . U S r , and the sets X C Si and Y C Sj. By Observation [131 an d the 
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definition of a BBS-partition, we have at most 

2(e+V6+~/)n' 2 

choices for the edges inside the set Sp., for each k G [r]. 

Next, recall that (1/r — e)n ^ |S^| ^ (1/r + e)n for each I G [r], by the definition of a 
BBS-partition, and so we have at most 

2(l/r+£) 2 n 2 

choices for the edges between S p and S q , for each p ^ q. Moreover, we have at most 

2{l/r+s) 2 n 2 -(l-VS)a 2 n 2 +l 

choices for the edges between Si and Sj. To see this, assume for simplicity that \X\ = 
\Y\ = an, and observe that we have at most 2^ 1//r+e_a ) n choices for the edges between 
Si\X and Sj \ Y, at most 2( 1 / r+e_a ) cm choices for the edges between X and Sj \ Y (and 
similarly for those between Y and Si \ X), and, by Observation [T3l at most 2 v ^ a2n2+1 
choices for the edges between X and Y. 
Putting these bounds together, we obtain 

log 2 (|^(P n ,a)|) < Q(i/ r + £ )V-^ + 0((£ + V5 + 7 )n 2 ). 

The first inequality now follows if e, 5 and 7 are sufficiently small. The final inequality 
follows by Observation [3l □ 

Next, given a graph G, and a partition P = (Si, . . . , S r ) of G, we say that a set of 
vertices B C V(G) is a-bad for (G,P) if 

|(r(M)nS' j )A(r(t;)nS , i )| ^ an 

for each u,v G -B with u ^ v, and each j G [r]. Let 

B(G,P,a) := maxjl^l : B C V(C?) is a-bad for (G,P)}. 

Now, given e, 5, 7, a > 0, c, n G N, and a hereditary property of graphs V, we define a set 
B(V n , a, c) C "P n as follows: 

B(V n , a, c) := {G eV n : 3& BBS-partition P of G for (e, 5, 7) with B(G, P, a)^c}. 

We next show that, if c = c(P) is sufficiently large then the collection B(V n , a, c) is small. 

Lemma 18. Let a > and r G N, and let V be a hereditary property of graphs with 
Xc(P) — r - There exist constants c = c(V,a) G N and a' = a'(V,a) > such that the 
following holds. Let e > 0, 5 > and 7 > be sufficiently small, and let n G N be 
sufficiently large. Then 

B{V n ,oc,c) c A(P n ,a'). 
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Proof. First, choose a graph H v G H(r + 1, v) \V for each v G {0, l} r+1 . Such graphs must 
exist because Xc(P) = r. Let m = max{\H v \ : v G {0, l} r+1 }, and let k = k(r,m) G N be 
the constant in Lemma [71 

Now, let t = t(a,r,k) G N be sufficiently large, let c = 03(0;, r,t) and a = 5s(c,a,r,t) 
be the constants in Lemma [12l and let a' = a/M(riQ,e), where M(.,.) is the constant 
in the refinement version of Szemeredi's Lemma, and n = n (r,t) is the constant in 
Observation [TU 

Let G G V n \ A(V n ,a'), let e, 5 and 7 be sufficiently small, and suppose that there 
exists a BBS-partition P of G for (e, 5, 7) such that B(G, P, a) ^ c, i.e., there exists a set 
B C V(G), with |S| ^ c, which is a-bad for (G,P). 

Claim : G is not U(r + 1, /c)-free. 

Proof of claim. By Lemma fl2l there exists a set 5' C -B, with \B'\ = 2 rt , and disjoint 
sets C S^, with \Tj\ = an for each i G [r] and j G [£], such that 

(a) If m, f G T/ } then r(it) H B' — T(v) n 5'. 

(6) If = {ifii, ■ ■ ■ j ^rt}, where Wij G T^. for each i G [r], j G [t], then S' — > II 7 . 

Let T = Uij^j^) anc ^ a PPly the refinement version of Szemeredi's Lemma (for e) to the 
(equi-) partition 

i=ij=i 

of T. We obtain, for each i G [r] and j G [i], a partition (Z7- (1), . . . , Z7- (m)) of T^, 
such that the resulting partition of T is a Szemeredi partition. Moreover, by our choices 
of constants above, we have m ^ no(r, t) and | ^ c/n for every i G [r] and j G [i]. 
Suppose first that there exists a pair (Uf(£), U^P (£')), where i ^ i', which is e-regular 

but not (e,£)-grey. Then the graph G[Lf } (f), C/jf } (f )] has density in [0,5) U (1 - 5,1], 
and so G G ^(T^, a'), a contradiction. 

Thus every e-regular pair (U^\l),Up\l')) is also (e, 6) -grey. By the definition of a 
Szemeredi partition, at most e(mrt) 2 pairs are irregular, and so the number of pairs 
(Uf(£), Up(£')) with % ^ i! which are not (e, 5)-grey is at most (1 - 2s) Q (mt) 2 . 

We apply Observation [Ml to the r-partite graph F where V(F) = {Uj(£) : % G [r],j G 

[£],•£ G [rn,]}, and a pair of vertices {Uj l \£) , Up (£')} (with 2 7^ z') is an edge of F if 
and only if they form an (e, 5)-grey pair. Since we chose e > sufficiently small, and 
m ^ no, it follows that there exist representatives {Wj : i G [r],j G [£]}, where H^- G 

{£/^(l), . . . , Uj % \rn)}, such that every pair (W^ , wjl ^) with i ^ i' is (e, <5)-grey. 

It follows, by the Embedding Lemma, that there exists a copy of U(r, k) C G with 
exactly one vertex in each set Wj . But £?' shatters this copy of Z7(r, fc), by condition (6) 
above, and so G is not Z7(r + 1, k)-free, as claimed. □ 
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By Lemma [7] and our choice of A;, it follows that H v ^ G G V for some forbidden graph 
H v , which is a contradiction. Thus B(V n ,a,c) C A(V n ,a'), as required. □ 

Given a > 0, a graph G on n vertices, a subset A C V(G) and two vertices w, u G V(Cr), 
we say that w is an a-clone of t> with respect to A if 

|(r(u)nA)A(i»nA)| ^ an. 

Let P = (Si, . . . , S r ) be a partition of G, and let B be an a-bad set for (G, P). If B is 
chosen to be maximal, then for every vertex v G V(G), there exists a vertex b £ B, and 
an index j G [r] such that t> is an a-clone of b with respect to Sj. Define 

j(v) := min [j £ [r] : 3 b G B such that v is an a-clone of b with respect to Sj}. 

Note that the function j(.) in fact depends on the triple (P,B,a). It will usually be 
obvious which partition P, set B and constant a > we are using, so we suppress this 
dependence. When it is not obvious from the context, we shall clarify. 

The following observation is an immediate consequence of the definition of an a-clone, 
together with Observation [T51 

Observation 19. Let a > be sufficiently small, let G be a graph on n vertices, let 
A,Bd V(G), and suppose v G V(G) is an a-clone of some vertex in B with respect to 
A. Then, given the edges of G[A, B], we have at most 

\B\2^ n 

choices for the edges between v and A. 

We would like to have v G Sjm for every v G V(G). In fact we shall prove that, for 
almost every graph G G V, we can adjust any given BBS-partition of G to guarantee that 
this holds. 

First, we shall show that almost all graphs G G V n have at most an/2 vertices with 
v ^ Sj( v -j. Indeed, given a > 0, a graph G, a BBS-partition P = (Si, . . . , S r ) of G, and a 
maximal a-bad set B for (G, P), let 

J(G,P,B,a) := {veV(G) : v £ S j{v) }. 

Now, given n G N, a, e, 5, 7 > and a hereditary property of graphs V, let 

C(V n , a) := |(j G V n ■ 3 a BBS-partition P of G for (e, 5, 7) and a maximal 

(2a)-bad set 5 for (G,P) such that | J(G, P, B, 2a) | ^ an}. 

The next lemma says that the set C(V n , a) is small. 

Lemma 20. Let r G N, and /e£ V be a hereditary property of graphs with Xc{P) — r - Let 
a > be small, and let e, 5, 7 > be sufficiently small, and n G N be sufficiently large. 
Then, 

\C{V n ,a)\ < 2 (1 - 1/r)n2/2 - an2/3r3 = o{\V n \). 
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Proof. The proof is almost the same as that of Lemma [T71 Indeed, let G G C(V n , a), and 
let P = (Si, . . . , S r ) be a BBS-partition of G for (e, 5, 7), and 5 C V(G) be a maximal 
(2a)-bad set for (G,P), such that | J(G, P, B, 2a>)\ ^ an. By the pigeonhole principle, 
there exists i, j G [r] (with i 7^ j) such that 

|C| :j(v)=j}\ > a'n, 

where a' = a/r 2 . 

Now we simply count the graphs in C(V n ,a). We have at most n n choices for the 
partition P, the set B, the index j and the set C. By Observation [131 and the definition 
of a BBS-partition, we have at most 

2(e+V8+~/)n 2 

choices for the edges inside the set Sf., for each k G [r]. 

Next, recall that (1/r — e)n ^ jS^I ^ (1/r + £)n for each G [r], by the definition of a 
BBS-partition, and so we have at most 

2(l/r+£) 2 n 2 

choices for the edges between S p and S q , for each p ^ q. Moreover, we have at most 

2(l/r+e) 2 n 2 -(a' /2r)n 2 

choices for the edges between Si and Sj. Indeed, by Observation [19] we have at most 
n 2v2o 7 |c|™ choices for the edges between C and Sj, and we have at most 2( 1 < /r+£ ) 2n2_ l c 'l' 1 ' /r 
choices for the edges between Si\C and Sj. 
Putting these bounds together, we obtain 

log 2 (|^(P n ,a)|) ^ Q(i/ r + e )V- + O ((e + Vd + j)n 2 ) . 

The first inequality now follows if e, 5 and 7 are sufficiently small. The final inequality 
follows by Observation [3J □ 

Now let G be a graph, let r G N, and let a, a', e, 8, 7 > 0. Given a BBS-partition 
P = (Si, . . . , S r ) of G for (e, 5, 7), and a maximal (2a)-bad set for (G, P), we make the 
following definition. 

Definition. An a- adjustment of [G, P) with respect to B is a partition P' = (SJ, . . . , S' r ) 
of V(Cr) such that, for each j G [r], the following holds. |SjASj| ^ an, and for every 
v E Sj, there exists 6 G B such that f is a (3a)-clone of b with respect to Sj. 

Given n G N, constants e, 5, 7, a > 0, and a hereditary property of graphs V, define 

T>(V n ,a) := |(3 G P n : 3 a BBS-partition P of G for (e, 5, 7) and a maximal (2a)-bad 

set B for (G, P) such that ^ an a-adjustment of (G, P) with respect to P j. 
The next lemma follows easily from the definitions. 



20 



NOGA ALON, JOZSEF BALOGH, BELA BOLLOBAS, AND ROBERT MORRIS 



Lemma 21. Let r G N, and let V be a hereditary property of graphs with Xc(P) = r ■ Let 
a > 0, and let e,5, 7 > be sufficiently small. Then, 

V(V n ,a) C C{V n ,a). 

Proof. Let G G V n , and suppose G G V(V n , a) \ C(V n , a). Let P = (Si, . . . , S r ) be an 
arbitrary BBS-partition of G for (e,5, 7), and let B be a maximal (2a)-bad set B for 
(G, P). Note that, since G C(V n , ®), we have 

\J(G,P,B,2a)\ = \{veV(G) : u £ S j{v) }\ < an. 

For each j G [r], let = {v G : j(^) = j}- We claim that P' = (S[, . . . , S' r ) 

is an a-adjustment of (G,P) with respect to B. Indeed, since \J(G, P, B,2a)\ ^ an, it 
follows immediately that l^-A^I < an for every j G [r]. Moreover, for each f G 5^ we 
have = j, and so there exists b <E B such that f is a (2a)-clone of b with respect to 
Sj. But IS'jAS'jl < an, so f is a (3o;)-clone of b with respect to Sj. 

Thus P' is an a-adjustment of (G, P) with respect to B, as claimed. But P and B 
were chosen arbitrarily, so this contradicts the fact that G G V(V n , a). Thus V(V n , a) C 
C(P n ,o;), as required. □ 

Finally, for each graph G on vertex set Si U . . . U S r , and each integer k G N, we choose 
a collection of vertex-disjoint copies of U(t, k) for each 2 ^ t ^ r + 1, using the following 
algorithm. 

Algorithm. Set £ := 1, t := r + 1 and X = 0. Repeat the following steps until t — 1. 

1. Suppose there exists a copy PT of P(t, A;) in G — X, and a function i : [£] — > [r] 
such that: 

(a) V(if ) = Ax U . . . U A u 

(b) A j+1 -> Ai U . . . U Aj for each j e [t - 1], 

(c) C Si(j) \ X for each j G [t], and 

(d) i(l) = i(2), and = i(f) & j = f for j,f > 2. 

Then set U e := V(H), X := X U U t and i := £ + 1, and repeat Step 1. 

2. Otherwise, set t := t — 1, and go to Step 1. 

In other words, we first find a maximal collection of vertex-disjoint copies of U(r + 1, k) , 
such that for each copy, the smallest two classes are in the same set = Si(2)) as each 
other, and the other classes in different sets (S^), . . . , Sit r+ u). We then find a maximal 
collection of vertex-disjoint copies of U(r,k), which are also disjoint from each of the 
copies of U(r + l,k). We repeat this for each 2 < £ < r + 1, in decreasing order. 

We obtain from the algorithm a collection {U±, . . . , U^}, where G[Ue] = U (£, k) for some 
2 ^ £ ^ t + 1, and the sets Ue are pairwise disjoint. The following observation describes 
the key property of these sets. 
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Observation 22. Let U\, . . . ,Ul be the sets obtained from the algorithm applied (for k) 
to the graph G and partition (Si, . . . , S r ) of V(G). Then, for each j £ [r] and £ £ [L\, if 

£ 

Ug fl Sj = then the set Sj \ Uj does not shatter Ue- 

3=1 

l 

Proof. Suppose G[Ue] = U(t,k), and A t+ \ — > Ue for some A t+1 C \\JUj. Then 

3=1 

Ug U A t+ i induces a copy of U(t + 1, k), and so this set would have been chosen at an 
earlier step of the algorithm. □ 

Now, given k £ N, a graph G, a BBS-partition P of G for (e, 5, 7), a maximal (2a)-bad 
set B for (G, P), and an a-adjustment P' = (S[, . . . , S' r ) of (G, P) with respect to 5, let 

L 

U(G,P',k) := |J^> 
1=1 

where {U\, . . . ,Ul} are the sets given by the algorithm, applied to the graph G and the 
partition P' . 

Given n, k £ N, constants e, 5, 7, a > 0, and a hereditary property of graphs V, let 
U(V n , a, k) := |g £ "P n : 3 a BBS-partition P of G for (e, 5, 7), a maximal (2a)-bad 

set B for (G, P), and an a-adjustment P' = (S[, . . . , S^) of 
(G, P) with respect to B with |C/(G, P', k)\ ^ n 1 ^ 



Theorem [T] is an easy corollary of the following lemma, together with Lemmas 
and [5TJ The proof of the lemma uses Theorem |5J and Lemmas [TJ [13 [T71 [THl [2D] and [2TJ 

Lemma 23. Let r £ N, and /ei V be a hereditary property of graphs with Xc{P) = f. 
There exist k = k(V) £ N and a = a(k,V) > such that the following holds. Let 
e, 5, 7 > be sufficiently small and n £ N be sufficiently large. Then 

\U(V n ,a,k)\ ^ 2 ( 1 ~ 1 /'> 2 / 2 - Q2 " 2 ~ a = o{\V n \). 

Proof. Let k be sufficiently large so that G is U{r + 1, A;) -free for every G £ P. Such a 
= A:(P) exists by Lemma [71 
We simply count the graphs in 

U n := U(V n ,a,k)\(B(V n ,a,n 1 - 2a )UV(V n ,a)y 

By Lemmas [T7] and [TS] we have 

\B(V n ,a,c)\ < 2( 1 ~ 1 ^ n2 / 2 - a ' n2 

for some a' = a'(V,a), if c = c(P, a) is sufficiently large. (Recall that \B(V n ,a,c)\ is 
monotone decreasing in c.) Also, by Lemmas [201 and [2T1 we have 

|P(P n ,a)| < 2 ^- 1 ^ n2 / 2 - a " n \ 
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where a" = a/3r 3 . Thus it suffices to prove the claimed bound for the set U n . 
So let G eW n , and note that G has 

(a) a BBS-partition P for (e, 5, 7), 

(b) a maximal set B C V(G), which is (2a)-bad for (G,P), with |P| ^ n 1-2 ", 

(c) an a-adjustment P' = (S^, . . . , S' r ) of (G, P) with respect to B, such that 

\U(G,P',k)\ > n l - a . 

Let U\, . . . , Ul denote the sets given by the algorithm, applied (for k) to the partition P' 
of G. By definition, 

U(G,P',k) = [jUj. 

j 

We have at most n n choices for the partition P', and the sets B and C/i, . . . , Ul- Now, 
given an edge e = afe, define the index i(e) as follows: 

(a) If e has an endpoint in B then z(e) = 0. 

(b) If e has an endpoint in Ug and the other endpoint is in 

e-i 

V(G) \ (B U (J ty) 
i=i 

then z(e) = I. 

(c) If e has both endpoints in V{G) \ (b U C/(G, P', fc)) then i(e) = 00. 

We choose the edges of G in increasing order of index. 

First, since \B\ ^ n l ~ 2a , we have at most 2' B ' n ^ 2 n Q choices for the edges incident 
with B. So let 1 ^ £ ^ L, and suppose that G[Ug] is a copy of U(t, k), where 2 ^ t ^ r. 
(Note that t 7^ r + 1, since G is U(r + 1, k)-free by our choice of k.) 

Claim : There is a constant A > 0, depending only on k and r, such that we have at most 

2 (l-l/r-X)\U l \n 

choices for the edges with index £. 

Proof. Without loss of generality, let U e = A x U . . . U A t , where A 1 ,A 2 C S' 2 , Aj C £>j for 
each 3 < j ^ i, and 

-»• Ai U . . . U A,- 

for each 1 ^ j ^ i — 1. Recall that, by the definition of an a-adjustment, each vertex 
u G S'a is a (3a)-clone of 6 with respect to SL for some b E B. Note also that (l/r—2a)n ^ 
liSyl ^ (1/r + 2a)n for each j G [r]. 
Thus, for each u G Z7^, we have at most 

2(l-2/r+2VH)n 

choices for the edges between « and V(G)\S[, by Observation [19| since a > is sufficiently 
small. But S[ does not shatter Ue, by Observation [221 an d so, by Lemma fT5l we have at 
most 

2\Ut\n/i — Ain 
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choices for the edges between Ue and S[, where Ai = \i(k,r) > is the constant in 
Lemma fl5l Choosing a = a(V, k) sufficiently small, the result follows. □ 

Now, let Tj = Sj \ U(G, P',k) for each j G [r] , and note that, since the algorithm 

stopped, Tj is U (fc)-free. Thus, by Theorem |2j we have at most 2 n2 2a choices for the 
edges inside these sets. Also, trivially, we have at most 

2 (l-l/r)(n-C/(G,P',fc))72 

choices for the edges between the sets Tj. 
Multiplying the number of choices, we get 

log2 (| W „|) , (, _ i) Ml! + (l - i - A) MG.P'Mn + o(„" 

< (i 4) t - - + °(»"0- 

Since |P(G, P', Jfe)| ^ n 1 "", the result follows if \U(G,P',k)\ s= An. 

Finally, suppose that \U(G, P', k) | ^ An. Then, by the pigeonhole principle and without 
loss of generality, there exists 2 ^ t ^ r and a subset X C [L] such that, for each j G X, 
C/j = U . . . U A t , where A 1} A 2 C 3' 2 , Aj C for each 3 < j ^ t, and 

-»• Ai U . . . U Aj 

for each 1 ^ j ^ t — 1, and \X\ ^ A2n, where A2 depends only on k and r. 

We have at most n n choices for the partitions (Si, . . . , S r ) and (S[, . . . , S^), and at most 
2(e+v / 5+7)« 2 choices for the edges inside the sets Sj. We have at most 2 an2 choices for the 

(r— 1\ 2/2 
^ , - 2 /r choices for the edges 

between S"- and Sj for i, j 7^ 1. Finally, we have at most i n l r{jl ~ n l T ^~ Xzn2 choices for the 
edges incident with S[, by Lemma [T5l 

Thus, choosing a sufficiently small, we obtain 



log 2 (|Z4|) < ( (r ffi 2) + ! 72 i )^ 2 -A 4 n 2 = (l-i-A 4 )n 2 , 

for some A 4 > 0, as required. □ 

Remark 1. Note that we in fact only needed B(G, P, a) ^ w v ~ 2a for almost every graph 
GeV. 

The proof of Theorem [1] now follows easily. 

Proof of TheoremUl Let V be a hereditary property of graphs with Xc{V) = r, and let 
/c = fc(P) G N be sufficiently large, and a = a(V, k), 7 = 7 CP, a, fc), 5 = 5(P, 7, a, fc) > 
and e = e(V, 7, 8, a, k) > be sufficiently small. By Lemma [16l almost every graph 
GeV has a BBS-partition for (e, 5, 7). So let GeV, let P be a BBS-partition of G for 
(e,5,7), an d let P be a maximal (2a)-bad set for (G,P). 
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Now, by Lemmas [2D] and [2T], f° r almost every such G there exists an a-adjustment 
P' = (S[, . . . , S' r ) of (G, P) with respect to B. Let U(G, P', k) denote the set given by the 
algorithm. By Lemma |23| U(G,P',k) ^ n x ~ a for almost every such G. 

Let A = B U U(G, P', k), and let Sj := S$ \ A for each j G [r]. Then Sj is U(k)-hee for 
each j G [r] , and Theorem [1] follows. □ 

8. A SHARPER BOUND FOR BIPARTITE GRAPHS 

In this section we refine the methods of the proof of Theorem [2j giving a close to sharp 
upper bound on 

f(n,n,U(k)) := | {G bipartite on A U B : \A\ = \B\ = n and G is C/(fc)-free}|. 

The exponent in our bound will be within a polylog-factor of the best that we could 
hope for, i.e., the exponent in the extremal result of Alon, Krivelevich and Sudakov [3] 
(see the more general Theorem 6.1 in their paper), that a [/(fc)-free graph with class sizes 
n cannot have more than (^(n 2-1 /^ 1 - 1 ) edges. 

We remark that to remove all of the seemingly unnecessary log-factors looks extremely 
hard. Corresponding results are known only for the monotone case, and even then only 
in special cases (for C4 by Kleitman and Winston [25], for Cq and C§ by Kleitman and 
Wilson [21], for {C4, Cq, . . . C^} by Kohayakawa, Kreuger and Steger [26], and for K Syt 
by Balogh and Samotij [T5] [Tl]). 

Theorem 24. For every 3 ^ k G N, and every sufficiently large n, we have 

f(n,n,U(k)) < exp(n 2 - 1/ ( fc-1) (]ogn) fc+1 ). 

We first prove the following lemma, the proof of which uses the methods of Section [6j 
Let G be a U(k)-iree bipartite graph with classes U and V where \U\ = \V\ = n. For each 
u,v G £7, we define A(u,v) := \T(u)AT(v)\, the 'distance' between the two vertices. 

Lemma 25. Let m,n,x,k G N, and let G be a U(k)-free bipartite graph with classes U 
and V , where \U\ = m and \V\ = n. Let U' C U , and suppose that, for any u, v G U' , we 
have A(u,v) ^ x. Then 

\U>\ ^ Qy^&^ogm)"- 1 . 

Proof. The proof is very similar to that of Lemma [TU1 Indeed, let c = \U'\, and with 
3 log c 

foresight, let p = . We claim that there exists a subset X C V, with \X\ = pn, 

x 

such that 

v(u)nx ^ T(v)nx 

for each u,v G U' with 11/0. Indeed, if we choose a random subset X C V of size pn, 
then 
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for each such pair {x, y}, and so 

e g r ') : r(x) ni = r( y ) n x 

Thus such a set X must exist, as claimed. Now, if 

then, by Sauer's Lemma, there exist sets U" C U' and X' C X, with \X'\ = k, such that 
U" — > X'. But this is a contradiction, since G is {/(fc)-free. Thus 

lrr/l ^ ^ /3nl0gc\ fc 1 ^ ofc/™^ 1 /, 

|l/'| = c<2( — -) <3'(-) (logm) 

as required. □ 

We are now ready to prove Theorem [2^1 

Proof of Theorem \2^\ Let \U\ = \V\ = n, and suppose that G is a bipartite [/(fc)-free 
graph with classes U and V. We are required to show that the number of choices for the 

edge set of G is at most exp \n 2 ~ 1 /( k ~ l \logn) k+1 \ 

The idea is to partition U into t + 2 parts, and consider the edges from each part to V 
in turn. Indeed, let t G N, let n > x > ■ ■ ■ > x t > n 1 ~ 1 ^ k ~ 1 \ and let 

U C . . . C U t C U 

be maximal subsets satisfying A(u,v) ^ Xi for each u,v G Ui and ^ i ^ t. Moreover, 
and with foresight, let t = 10 log log n, and let 

Xi := ni-V^-iHVtfc-i)^ 2 

for each ^ « ^ t. Note that we have at most 2 n choices for the sets U. 
Now, by Lemma 1251 

fe-i 

- . 3 fc (bgn) fc_1 

\ X i J 

for each ^ % ^ t, so we have at most 

exp (—^j 3 h (\ogn) h ~ 1 

choices for the graph G[U , V). Now let ^ i ^ t — 1, and assume that the sets t/j and 
Ui+i and the graph V] have already been chosen. 

Claim: We have at most 

fe-i 



exp ( ( ) 3 fc+i (logn 



choices for the edges between U i+ i \ Ui and V. 
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Proof of claim. Since U is maximal, for every v G Ui + % \ U there is a u G U$ such that 
A(u,v) < X{. Thus, the number of choices for the edges between v and V is at most 



2\U 



and so the number of choices for the graph G[Ui + i \ Ui, V] is at most 

2m {x)T +l1 ^ (^ +1 ) (nM+l)fc " l3fc(1 ° gn)fc " 1 < exp V(logn)* 

as claimed. □ 

Finally, given U t and G[U t , V], the number of choices for the edge between U \ U t and 
V is at most 



2\U, 



< exp ((x t + l)nlognj < exp (2n 2 ^lognj. 



since for each u there is a vertex v G Ut such that A(u,v) ^ Xt- There are at most n 
choices for v and 2(") choices for the symmetric difference. 

Putting these bounds together, the number of choices for the graph G[U, V] is at most 

e X p L Voogn)*- 1 + 2n 2 - 1 ^ k - 1 hogn ^(logn)* 



=o 

\i+2 



But, recalling that Xi = n l 1 )+ 1 /( fc 1 ) 1 , we have 

,2-V(fc-l) 



)k— 1 / \ fc— 1 

/ n 
= Xi I = ir 



for each ^ i ^ t — 1. Since t = O (log log n), this gives an upper bound of 

exp f(7rc 2 ~ 1/(fc_1) (logn) fc loglog7i 
as required. □ 

We cannot hope to obtain very sharp results from such a (relatively) simple application 
of Sauer's Lemma. However, our results are close to optimal, if we do not care about 
poly-log factors in the exponent. For example, Theorem [2^1 for k = 3 gives an r?l 2 in the 
exponent, and 3/2 is best possible, as U(3) contains a C4. In general, U(k) contains a 
graph with average degree at least 2{k — 3), and so we have the bounds 

exp (n 2 - 1/(fc - 3) ) < f(n,n,U(k)) ^ exp (n^^-^logn)^ 1 ). 

It would be interesting to eliminate (if possible) the log n-f actors from the exponent; 
this would yield Kleitman- Winston-type results for many different bipartite graphs. 



THE STRUCTURE OF ALMOST ALL GRAPHS IN A HEREDITARY PROPERTY 



27 



9. QUESTIONS 

The most obvious disadvantage of Theorem [I] is that we know almost nothing about 
the structure of a typical U (fc)-free graph. 

Question 1. What is the structure of a typical U(k)-free graph? In particular, are almost 
all U(k)-free graphs either dense or sparse? 

As we remarked in the Introduction, there has recently been some important progress 
on hereditary properties of hypergraphs. In particular, we noted the following theorems 
of Dotson and Nagle [18], Ishigami |22] and Person and Schacht |32j . 

Given a hereditary property of fc-uniform hypergraphs V, the extremal number of V is 
defined to be 

ex(n, V) := max j : A C > and tnere exists M C y^j \ ^ sucn tnat 

M U A' G V for every A' C A \. 



In other words, it is the maximum dimension of a subspace of V n , in the product space 

{o,i}( [ ^). 

Theorem 26 (Dotson and Nagle [18], Ishigami [22]). Let k G N and let V be a hereditary 
property of k-uniform hypergraphs. Then 

i v I _ 2 ex ( n .^)+°( nfc ) 

Theorem 27 (Person and Schacht [32J). Almost every Fano-plane-free 3-uniform hyper- 
graph is bipartite. 

These results give reason to be optimistic that the following question, which until 
recently would have seemed very far out of reach, may now be approachable. 

Question 2. What is the structure of a typical member of a hereditary property of k- 
uniform hypergraphs? 

Finally, we note that Theorem [T] is considerably weaker than Conjecture [Q since the 
set A can be very large, and because our bounds on k are likely far from best possible. 
The following problem asks for some progress towards the conjecture. 

Problem 1. In the statement of Theorem^ improve the upper bound on \A\, and give 
good bounds on the constant k(V). 
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